AITopics | cumulative payoff

Collaborating Authors

cumulative payoff

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Almost Optimal Algorithms for Linear Stochastic Bandits with Heavy-Tailed Payoffs

Han Shao, Xiaotian Yu, Irwin King, Michael R. Lyu

Neural Information Processing SystemsNov-20-2025, 14:42:20 GMT

In linear stochastic bandits, it is commonly assumed that pa yoffs are with sub-Gaussian noises. In this paper, under a weaker assumption on noises, we study the problem of lin ear stochastic b andits with he avy-t ailed payoffs (LinBET), where the distributions have finite moments of order 1+ ϵ,f o rs o m e ϵ (0, 1] .W e rigorously analyze the regret lower bound of LinBET as Ω( T

artificial intelligence, data mining, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Asia > China > Hong Kong (0.04)
North America > Canada > Quebec > Montreal (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.94)
Information Technology > Data Science > Data Mining > Big Data (0.48)

Add feedback

Selection pressure/Noise driven cooperative behaviour in the thermodynamic limit of repeated games

Tah, Rajdeep, Benjamin, Colin

arXiv.org Artificial IntelligenceJul-22-2024

Consider the scenario where an infinite number of players (i.e., the \textit{thermodynamic} limit) find themselves in a Prisoner's dilemma type situation, in a \textit{repeated} setting. Is it reasonable to anticipate that, in these circumstances, cooperation will emerge? This paper addresses this question by examining the emergence of cooperative behaviour, in the presence of \textit{noise} (or, under \textit{selection pressure}), in repeated Prisoner's Dilemma games, involving strategies such as \textit{Tit-for-Tat}, \textit{Always Defect}, \textit{GRIM}, \textit{Win-Stay, Lose-Shift}, and others. To analyze these games, we employ a numerical Agent-Based Model (ABM) and compare it with the analytical Nash Equilibrium Mapping (NEM) technique, both based on the \textit{1D}-Ising chain. We use \textit{game magnetization} as an indicator of cooperative behaviour. A significant finding is that for some repeated games, a discontinuity in the game magnetization indicates a \textit{first}-order \textit{selection pressure/noise}-driven phase transition. The phase transition is particular to strategies where players do not severely punish a single defection. We also observe that in these particular cases, the phase transition critically depends on the number of \textit{rounds} the game is played in the thermodynamic limit. For all five games, we find that both ABM and NEM, in conjunction with game magnetization, provide crucial inputs on how cooperative behaviour can emerge in an infinite-player repeated Prisoner's dilemma game.

grim, payoff, thermodynamic limit, (14 more...)

arXiv.org Artificial Intelligence

2407.15801

Country:

North America > United States > New York (0.04)
Asia > India > Maharashtra > Mumbai (0.04)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.34)

Add feedback

Multi-agent Cooperation in Diverse Population Games

Wong, K., Lim, S. W., Gao, Z.

Neural Information Processing SystemsDec-31-2005

We consider multi-agent systems whose agents compete for resources by striving to be in the minority group. The agents adapt to the environment by reinforcement learning of the preferences of the policies they hold. Diversity of preferences of policies is introduced by adding random biases to the initial cumulative payoffs of their policies. We explain and provide evidence that agent cooperation becomes increasingly important when diversity increases. Analyses of these mechanisms yield excellent agreement with simulations over nine decades of data.

agent, attractor, cumulative payoff, (15 more...)

Neural Information Processing Systems

Country: Asia > China > Hong Kong (0.05)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

Multi-agent Cooperation in Diverse Population Games

Wong, K., Lim, S. W., Gao, Z.

Neural Information Processing SystemsDec-31-2005

agent, attractor, cumulative payoff, (15 more...)

Neural Information Processing Systems

Country: Asia > China > Hong Kong (0.05)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

Multi-agent Cooperation in Diverse Population Games

Wong, K., Lim, S. W., Gao, Z.

Neural Information Processing SystemsDec-31-2005

We consider multi-agent systems whose agents compete for resources by striving to be in the minority group. The agents adapt to the environment by reinforcement learning of the preferences of the policies they hold. Diversity of preferences of policies is introduced by adding random biases tothe initial cumulative payoffs of their policies. We explain and provide evidence that agent cooperation becomes increasingly important when diversity increases. Analyses of these mechanisms yield excellent agreement with simulations over nine decades of data.

agent, artificial intelligence, attractor, (16 more...)

Neural Information Processing Systems

Country: Asia > China (0.14)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback